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ABSTRACT 

We use the existing catalog of Damped Lyman-Alpha (DLA) systems to place 
constraints on the amount of evolution in the baryonic content of galaxies and on the 
value of the cosmological constant. The density of cold gas at redshifts z = 3 db 1 is 
obtained from the mean HI column density of DLAs per cosmological path length. 
This path length per unit redshift is in turn a sensitive function of the vacuum density 
parameter, We compare the total inferred mass of cold gas at high redshifts to 
that observed in stars today for cosmologies with Q m + £l\ = 1, where O m is the 
matter density parameter. We define r\ to be net fraction of the baryonic content of 
local galaxies which was expelled since z = 3, and use Bayesian inference to derive 
confidence regions in the (rj, Q\) plane. In all cosmologies we find that -n < 0.4 with at 
least 95% confidence if < 25% of the current stellar population formed before z = 3. 
The most likely value of rj is negative, implying a net increase by several tens of percent 
in the baryonic mass of galaxies since z = 3± 1. On the other hand, recent observations 
of high metal abundances in the intracluster medium of rich clusters (Loewenstein & 
Mushotzky 1996) require that metal-rich gas be expelled from galaxies in an amount 
approximately equal to the current mass in stars. Based on our results and the low 
metallicity observed in DLAs at z ;> 2, we infer that more than half of the baryonic 
mass processed through galaxies must have been assembled and partly expelled from 
galaxies after z = 2. We expect our constraints to improve considerably as the size of 
the DLA sample will increase with the forthcoming Sloan Digital Sky Survey. 
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1. Introduction 

To date, some ~ 80 Damped Lyman- Alpha (DLA) absorption systems have been identified in 
the spectra of high-redshift QSOs. The observed absorption troughs indicate concentrations of 
neutral gas, with large HI column densities, N ;> 10 20 cm" 2 . There is considerable evidence that 
these objects are associated with the progenitors of present-day galaxies. This has been confirmed 
in several cases by direct imaging of QSO fields (Steidel et al. 1994, 1995, 1996; Djorgovski et 
al. 1996; Le Brun et al. 1996). There are various indications that DLAs are associated with 
young galaxies. The abundances of metals at low ionization stages in DLAs are comparable to 
those found in disk galaxies (Wolfe 1995). The metal absorption line profiles show a leading-edge 
asymmetry (Wolfe 1995; Prochaska & Wolfe 1997) and are shifted relative to the Lya emission 
redshift (Lu, Sargent, & Barlow 1997), as expected from a rotating thick disk with circular 
velocities comparable to those seen in spiral galaxies. Furthermore, observations of redshifted 
21-cm absorption and emission from DLAs indicate disk-like structures of galactic dimensions 
(Briggs et al. 1989; Wolfe et al. 1992). However, recent HST images (Le Brun et al. 1996) 
have revealed that DLA galaxies span a wide range of morphological types. The existence of 
a substantial galaxy population at redshifts z ^ 2 is consistent with most CDM models, which 
predict that galaxies should form by z « 2 — 3 (Frenk et al. 1996, and references therein). 

The HI column densities in DLAs can be derived from the equivalent widths of the observed 
absorption features. The mean HI column density along a random line of sight may then be 
summed up and divided by the absorption path length probed to obtain the comoving spatial HI 
mass density phi in galaxies at a given redshift. Since DLAs dominate the HI mass in the universe 
(Lanzetta et al. 1995, and references therein), their statistics can be used to infer the evolution of 
the comoving HI density from high redshifts up to the present epoch. Recent work (Lanzetta et al. 
1995; Wolfe et al. 1995; Storrie-Lombardi, McMahon, & Irwin 1996) has shown that, for universes 
with a zero cosmological constant (A = 0), the inferred comoving total gas density p g = 1.3/5hi 
(including HI and He) at a redshift z ~ 3.5 is comparable to the stellar mass density observed 
in present-day galaxies. This was seen to be consistent with a simple "closed-box" picture in 
which galaxies had formed by z = 3.5, and the neutral gas was subsequently converted into stars, 
while the total baryonic mass (gas + stars) is conserved. Note that the closed-box model does 
not assume that the baryonic masses of individual galaxies are conserved but rather that the total 
baryonic mass of all galaxies is not evolving with time. This model allows for mergers which 
conserve the total mass of all of the galaxies involved. 

Evolutionary models may be generalized to include a net infall of gas from the intergalactic 
medium, or a net outflow of gas from DLA galaxies. The closed-box assumption represents the 
simplest evolutionary model, but in general we do not expect the total baryonic mass in galaxies 
to be conserved. In particular, the intracluster medium (ICM) in rich clusters of galaxies contains 
an amount of iron which is approximately equal to the amount locked in stars in these clusters 
(Renzini et al. 1993; Loewenstein Sz Mushotzky 1996). The natural explanation is that the metals 
observed in the ICM were produced inside of galaxies and subsequently expelled into the ICM by 
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super nova-driven winds. In this paper, we examine whether the DLA data are consistent with the 
substantial outflow needed to account for this observation. 

In the subsequent discussion, we also allow A / 0, and explore the family of models with 
£l m + = 1, where O m and are the mean present-day cosmic mass densities in matter 
and vacuum energy, respectively, in units of the critical density. The absorption path length 
corresponding to a given redshift interval depends on the assumed cosmological model; it is 
shortest for flat, A = universes, longer for open, A = universes, and longer still for flat, A^O 
models. Because the absorption path length is longest for A-dominated cosmologies, we expect 
the inferred value of p g at high redshifts to be the smallest, and hence the deficit with respect to 
the present-day stellar density to be the largest in these cosmologies. This approach is similar to 
methods which constrain based on the statistics of gravitational lensing (Kochanek 1996, and 
references therein). In closed-box models, one may investigate the constraints that can be placed 
on Q\ by requiring, for some suitably chosen z > 2, that p g (z) be comparable to the present-day 
mass density in stars, p s (0). However, a net infall of gas between the high-redshift epoch and z = 
can bring the DLA data into reasonable agreement with high-A cosmologies. For our purposes, 
it is sufficient to parameterize the amount of accretion or expulsion as rj = Pb,gai( 2; )/Pb,gai(0) — 1) 
where Pt>,gai = Pg + Ps is the total baryonic density in galaxies, and /9b,gai(0) ~ Ps(0). In this paper 
we explore the constraints on both galaxy evolution and cosmology by constructing confidence 
regions in the (rj, plane. To simplify our analysis, we make the reasonable assumption that the 
distribution of DLAs in HI column density N and redshift z, F(N,z), is separable into functions 
of N and z over a suitably small redshift interval. This allows us to fit the column density 
distribution separately from the extraction of constraints on galaxy formation or the underlying 
cosmology [the latter being exclusively related to the redshift dependence of F(N, z)\. 

The value of p s (0) is calculated by multiplying the local luminosity density by the mean 
mass-to-light ratio. Recent imaging of DLA galaxies (Le Brun et al. 1996) indicates that, while 
some are spirals, a significant fraction of these objects have irregular morphologies, which may 
indicate that a substantial amount of merging was taking place at high redshifts. Since mergers 
may result in the formation of elliptical galaxies, we include all galaxy types in our calculation 
of the local luminosity density, and hence p s (0). The assumption that underlies our discussion 
is that star formation requires cold HI gas, which must be represented in a fair sample of all 
Lya absorption systems, irrespective of whether the star formation process occurs in spirals or in 
ellipticals. 

In §2.1 we show how the comoving HI density is inferred from the DLA sample, and how 
sensitive it is to the underlying cosmology. Section 2.2 adds the impact that evolution in the HI 
content of galaxies might have on our analysis. In §3 we discuss the statistical methods used to 
compare the data to the theoretical predictions for p g (z). In §4 we present the derived confidence 
intervals for our constraints on the evolution of galaxies and the cosmological constant. Finally, 
§5 summarizes our conclusions. 



- 4 - 



2. Evolution Of The Comoving HI Density 

In order to extract useful constraints from the data, we must predict some observable property 
of the DLA sample, and show explicitly how our prediction depends on our assumptions about 
cosmology and evolution. In §2.1, we show how, for a given cosmology, the comoving HI density 
pm(z) is related to the total DLA column density along a line of sight. In §2.2, we introduce a 
parameter which characterizes the amount of evolution in the HI density. 



2.1. Inferring pm(z) from the DLA Sample: Effects of Cosmology 

The comoving HI density in DLA systems at a redshift z is inferred by calculating the mean 
HI column density in a proper length interval cdt along a line of sight, and dividing the result by 
(c/Ho)dX = (1 + z) 3 cdt. Since the absorption path length element dX corresponding to a given 
redshift element dz depends on the cosmological parameters f2 m , Q\ (the mean present-day cosmic 
mass densities in matter and vacuum energy, respectively, in units of the critical density), the 
inferred value of the comoving HI density will depend on the assumed geometry of the universe. 
Let F(N, z)dNdz be the mean number of DLAs along a line of sight with HI column densities 
between N and N + dN and redshifts between z and z + dz. Note that F(N, z) is different from 
the function f(N,z) usually encountered in the literature; the latter is conventionally defined such 
that f(N, z)dNdX gives the number of DLAs with with column densities between N and N + dN 
and absorption distances between X{z) and X(z) + {dX/dz)dz. The inferred comoving HI density 
is given by 

where toh is the mass of a hydrogen atom, and N m \ n = 2 x 10 20 cm~ 2 is the minimum HI column 
density included in the sample. For a cosmological model with density parameters (f2 m ,OA) and 
an open or flat geometry, the absorption path length element dX is given by 

dX = {l + zfdz 

VOn(i + zf + (i - n m - n A )(i + z y + n A 



pm(z) 



F(N, z)NdN, (1) 



For a flat matter-dominated universe with J7 m = 1 and f^A = 0, dX = (1 + z) l l 2 dz; for a 
low-density universe with f2 m = and = 0, dX = (1 + z)dz; and for a flat, A-dominated 
universe with Q m = and J7a = 1> dX = (l + z) 2 dz. Hence, because the path length corresponding 
to a given redshift interval dz is longest for A-dominated cosmologies, the Ohi(^) value inferred 
from equation (|lj) is smallest for large Oa- In this paper we will only consider the family of flat 
cosmologies with Q m + Oa = 1 ; and will write all cosmology-dependent expressions in terms of 
the single adjustable parameter J1a- 



The observed distribution F Q y }S (N, z)dNdz, defined as the total number of DLAs observed in 
the sample with column densities between N and N + dN and redshifts between z and z + dz, 
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depends on the sensitivity of the QSO sample to the detection of a DLA feature at redshift z. In 
the absence of obscuration of background QSOs by dust in the DLAs, 



F ohs (N,z)=g(z)F(N,z), 



(3) 



where 



9W 



(4) 



Here, H{x) the Heaviside step function, m is the total number of QSOs in the sample, and 
(zf 1111 , zf 1 ^) is the redshift window over which the observations are able to detect a DLA feature 
in the spectrum of the ith QSO (depending on the redshift of the QSO and the response of the 
detector). Thus, g(z) is the number of lines of sight for which a DLA feature is detectable at an 
absorber redshift z. We therefore have 



Pm(z) 



( H mn \ 



V /(1_0 A )(1 + Z )3 + ^ A 
(1 + z) 2 



1 



g{z) Jn, 



F ohs (N,z)NdN. 



(5) 



Equation (H) expresses the inferred value of pm(z), factored into functions that depend exclusively 
on Qa, which we treat as a free parameter, and the properties of the QSO and DLA samples, 
which are fixed by observations. 



2.2. Predicting />hi (^): Effects of Evolution 

Given a specified galaxy evolution picture, the evolution of the gas density is obtained by 
solving the equations of cosmic chemical evolution (e.g. Pei & Fall 1995): 

Pg + Ps = Pb.gal, (6) 

p g Z - yp s = (Zf - Z)p\ g£k \. (7) 

Here, the dot denotes a time derivative; p g is the total gas density (note that p g > pm, since p g 
includes HII, H2, He, and heavier elements); p s is the mass density in stars; /9b,gai is the net rate 
at which the total baryonic mass density changes (pb.gai > corresponds to a net accretion of 
material from the intergalactic medium, while Pb,gal < corresponds to a net expulsion of material 
from galaxies); y is the mean stellar yield (mass fraction of elements heavier than He produced in 
stars), averaged over the stellar initial mass function (IMF); Z is the metallicity of the gas (mass 
fraction of elements heavier than He present in the gas) in galaxies, and Z{ is the metallicity of 
the infalling gas. The simplest solution is the "closed-box" solution, which has no net accretion or 
outflow (pb.gal = 0): 

Z{z) 

y 

where we have assumed Z(oo) = 0. Solutions which include accretion or outflow have been 
identified by Pei &: Fall (1995) for the case where p\ )ga \ oc p s . 



Pg(z) = p g (oo)exp 



(8) 
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For simplicity, we will assume that p g is constant within a sufficiently narrow redshift interval 
around z = 3. Although the minimum width of this interval is limited to Az ~1— 2 by the 
current size of the DLA sample, one can imagine that forthcoming DLA surveys will allow one to 
narrow it much more in the future. In principle, one could have considered the case of a constant 
nonzero rate of change p g ; however, even with Az ~l-2 this produces results which do not differ 
appreciably from the case where p g is a constant, for a range of reasonable values of p g . 

Any deficit in the gas density p g (z) at z = 3 relative to the total present-day baryonic mass 
Pb,gai(0) = Pg(0) + p s (0) ~ p s (0) will be due to a combination of two effects: (1) some material 
may have accreted onto existing galaxies or assembled into new galaxies since z = 3, and (2) some 
star formation may have already occurred by redshift z = 3, depleting part of the gas. Effect (2) 
is expected to be sub-dominant based on the low metallicities Z ~ O.1Z observed at redshifts 
z ^ 2 (Lu et al. 1996) and the relatively small star formation rates observed at such high redshifts 
(Madau 1996). Nevertheless, we will lump the two effects together, and will parameterize the 
combined contribution to the deficit by defining 

1 = ^-1, (SO 

where 1 + rj is the fraction of Pb,gai(0) which was present at a redshift of 3. (We choose z = 3 as 
the fiducial redshift for the comparison, but since we assume that p g =constant over an interval 
containing z = 3, we could just as easily choose any other redshift in this interval for the definition 
of rj.) Hence, r\ < corresponds to a net accretion and rj > corresponds to a net expulsion of 
material since z = 3. Star formation prior to z = 3 is expected to produce at most ~25% of p s (0) 
(approximately the fractional area under the the curve p s (t) inferred from Madau 1996, corrected 
for high- A cosmologies) . Let / be the fraction of the total present-day mass in stars which were 
produced by z = 3 ± 1; then, with Pb,gai(0) ~ p s (0), we have 

Pg (3) = (l + v -f) Ps (0)- (10) 

Finally, we neglect the contributions of ionized hydrogen, molecular hydrogen (see Ge & 
Bechtold 1997), as well as metals, to p g at redshifts z ^ 2, but include helium, 25% by mass, 
resulting in a mean molecular weight of 1.3mn- We therefore use p g = 1.3pni- 



3. Statistical Methods 

Next we describe the methods used to compare the data to the model prediction pm(z) for the 
mean HI density at high redshift, at given values of r] and In §3.1, we construct the likelihood 
function, and in §3.2, we use Bayesian analysis to derive confidence regions in the (77, Qa) plane 
from the likelihood function. 
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3.1. Constructing a Likelihood Function 

The likelihood function gives the probability of obtaining a particular data set, assuming 
the truth of a specified model. The model should include predictions for both the expected 
value of the quantity in question and the distribution of measurements due to statistical (e.g. 
Poisson) fluctuations about the expected value. The most commonly used likelihood estimator in 
the literature is the x 2 statistic, which is simply proportional to the logarithm of the likelihood 
function for normally distributed errors. Its widespread use is due to two convenient facts: the 
tendency, due to the Central Limit Theorem, of sums of many random variables to be normally 
distributed; and the fact that the distribution of the x 2 statistic for normally distributed errors 
is known, which makes it easy to calculate confidence regions. However, the x 2 statistic is not 
appropriate for our analysis here. To apply the \ 2 statistic, we would need to collect the data 
into redshift bins, and estimate the value of /?hi in each bin by summing up the column densities 
and dividing by the total absorption path length probed in the bin. However, given our small 
sample size (only 73 objects in total), such experimentally-measured values of pui will not be 
normally distributed. In fact, the values we measured from Monte-Carlo-simulated data sets show 
a significant skewness in their distribution, and so we are not justified in using the x 2 statistic to 
calculate confidence regions. Moreover, this approach introduces arbitrariness in the choice of bin 
size and also loses information about the DLA sample by binning the data. We have therefore 
chosen to construct a likelihood function which makes use of the unbinned data, and which does 
not rely on the assumption of normally distributed errors. 

The starting point for constructing our likelihood function is to note that any prediction for 
PHI (z) can be related to the observed distribution of DLAs in column density and redshift using 
equation (||). Since we assume that pm(z) is approximately constant over some narrow redshift 
interval around z = 3, we may use equation fllOD, and rearrange (|5|) to write 



F ohs (N,z)NdN 

N min 



(— 



(l + z) 2 g(z) 



Ps0 (i + v-f), (ii) 



where p s o = p s (0). Thus, given an evolution model pm( z ) an d a cosmological model Q\, we have a 
definite prediction for a property of the DLA sample, namely the total column density as a function 



of redshift [given by the left hand side of equation ([11])] . Note that (11) gives the expected value 
of this quantity; it is not obvious how the values measured from hypothetical data sets should be 
distributed about this expected value (as mentioned above, given the small sample size, the values 
in each redshift bin will not be normally distributed). Specifically, the distribution of measured 
Phi (z) values depends on the shape of i ? bs(A r , z) as a function of HI column density N, and will 
be skewed toward smaller values if F \, s (N,z) is dominated by low— iV systems, and vice- versa. 
Unfortunately, our evolution models pm(z) predict only the first moment of F Q ] :)S (N,z), not the 
distribution itself. However, the DLA sample does tell us something about the full distribution; in 
particular, the entire sample is well-fitted by a so-called gamma distribution (Storrie-Lombardi, 
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Irwin, & McMahon 1996): 



F ohs (N, z)=F it [ — J exp ( -— ) , (12) 



where -F* and iV* are functions of z and 7 =const. If we specialize to the case where iV* is 
a constant, then F+ contains all of the z-dependence, and is proportional to dX/dz (i.e., all 
of the redshift dependence is due to the cosmological geometry). In this case, we may rewrite 
equation ([□]) as 



n(z-,n A ,r],N ; 



a,vg) 




(l + zfgiz) 



\H m n J \N avg l ^/(l-n A )(l + z yi + n A 



(13) 



where 



_ JZ in F UN,z)NdN _ r(2- 7 ,iV min /iV,) 

vs " ^(iv, z)dN iV * ' r(i - 7 , iWi\y 

is the mean column density in the distribution, with T(a, x) = f°° t a ~ l e~ t dt the incomplete gamma 
function. Note that equation (|l~3|), with -/V avg independent of redshift, applies for any distribution 
F Q b s (N,z) which is separable into functions of N and z. The assumption of separability is 
particularly appealing if only a small fraction of the available HI is depleted from DLA galaxies 
during the redshift range under consideration. Suppressing the dependence on Qa> Vj an d -Wavg for 
brevity, the quantity n(z) defined in equation (|l~3|) denotes the redshift distribution of absorbers; 
the number of systems observed with redshifts between z and z + dz is n(z)dz. 

For the moment, assume that we know the shape of F Q ^ S (N, z) a priori, and hence that we 
know -/V av g from equation flli]) . [We will address the question of how to incorporate properly our 
incomplete knowledge of F Q y }S (N, z) from the data into our analysis in the next section.] Then we 
may construct a likelihood function from equation (^) as follows. Suppose we wish to compare 
the data to our prediction over a range of redshifts (z m - m , z max ). If we divide this range into 
sufficiently small intervals dz, such that n{z)dz <C 1, then there will be at most one object in each 
such interval. Then the probability of finding no objects in a given interval at a redshift z is given 
by the Poisson distribution, P(0) = exp [— n(z)dz\\ similarly, the probability of finding one object 
is -P(l) = n(z)dz exp[— n(z)dz]; and the probability of finding more than one object is negligible, 
X]^2 ~ 0{[ra(z)cfz] 2 } <C 1- Then the likelihood, or conditional probability of obtaining a 
particular data set D from a distribution n(z) predicted by ( |l3| ) is 

{m ob s ~) (m nonc "| 

Yl n(zi)dz exp[-n(zi)dz] \ < J| exp[-n(zj)dz] > , (15) 

where m Q b s is the number of DLA systems observed, z% is the redshift of the i-th system, m none 
is the number of intervals dz in our range (-Smiru ^max) which have no DLAs in them [note that 
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none — (-^max ^min) / ' dz ], zj is the redshift of the jth such interval, and we have suppressed 
the dependence of n(z) on the parameters (f^A, rj, -/V avg ) for brevity. For dz <C 1, equation ( |T5| ) 
may be written as 





~ m obs 




P 2max 


P(D\n A ,rj,N avs ) = 


Y[ n(zi)dz 


exp 


— / n(z)dz 




. i=l 




J Z m [ n 



This probability clearly depends on the chosen size of the interval dz, and shrinks to zero as 
dz — > 0. This reflects the fact that, as dz shrinks, the number of possible distinct outcomes D 
increases, so the probability of obtaining any particular data set D goes to zero. In the next 
section, we will use Bayesian inference to construct confidence intervals from the likelihood 
function fll6|). 



3.2. Computing Confidence Regions with Bayesian Inference 

Equation (|l^) gives the probability of obtaining our DLA sample as a random realization 
of the redshift distribution (|l~3|), assuming the truth of a particular cosmology S7a and evolution 
model rj, and assuming knowledge of the expected average column density N avg in the sample. 
However, our evolution model for pm(z) does not make any predictions about the value of iV avg ; 
we must make use of the DLA sample to extract information about the range of reasonable values 
for N avg . In addition, the value of p s o is uncertain since it is related to the uncertain mass-to- light 
ratios of present-day galaxies. Both of these uncertainties in our knowledge may be rigorously 
incorporated into our analysis if we use Bayesian inference. 

Bayes' Theorem follows trivially from the axioms of probability and the definition of 
conditional probability. The theorem relates P(&a, ?]\D, iV avg ), the conditional probability 
distribution for values of the model parameters given the observed data and an assumed value of 



Aavg, to the likelihood P(D\Q\, rj, -/V avg ) [cf. eq. (16)]. The conditional probability of obtaining 



the observed data set as a random realization of the model with particular parameter values is, 

p(o A , n\D, iv avg ) = a • p(n A , 7?|iv avg ) p(d\q a , n, iv avg ). (17) 

Here, P(J}\,rj\N aV g) is the prior probability distribution for the two parameters, which, in the 
absence of any previous data, we take to be uniform and independent of -/V avg ; and A is a 
normalization constant which ensures that J P($l\,rj\D, N avg )dfl\dr] = 1. Note that the value of 
dz in equation (|l6|), which may be taken to be arbitrarily small, is absorbed into A. Unfortunately, 
our a priori knowledge does not tell us the precise value of iV avg . However, we may include as 
an additional component of our model the assumption that the data are well fitted by a gamma 
distribution (|l^). By fitting a functional form (^) to the data, we may obtain a prior distribution 
P(N SbVg ) of reasonable values of A^ avg . The procedure for obtaining -P(A r avg ) is identical to the 
procedure we employ to get P(Cl\, rj\D, iV avg ). Note that the gamma distribution is not the only 
plausible functional form to fit to the data; however, it provides a better fit than a single power 
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law (Storrie-Lombardi, Irwin, & McMahon 1996), since there is a significant break in the observed 
slope of F(N,z). Any other reasonable two-parameter distribution (e.g. a broken power law) will 
yield similar results to the gamma distribution. 

The sum rule of probability theory allows us to "marginalize" the parameter iV av g by 
integrating equation (|17]) over values of iV avg , weighted by P(N svg ): 



P(n A , v \D) = A- / P(N ; 



•avg 



) P(D\n AlV ,N avg ) dN avg , 



(18) 



where we have assumed that P(Q\,rj\N avg ) is uniform, and have absorbed all constants into A 
such that P(Q\,r]\D) is still normalized to unit area. We may incorporate the uncertainty in p S Q 
in a similar way. As will be seen in §4.1, the uncertainty in p S Q is approximately Gaussian. Then, 
making the dependence on p S Q explicit, we finally have 



p(n A , v \D) = A- 



dN : 



avg 



dp sQ -P(A r av g ) exp 



{PsO - Pso) 

2a 2 



P(D\n A ,rj,N avs , Ps0 ), (19) 



where once again the normalization has been absorbed into A. We will use equation (19), 
substituting equations (13) and (16) for P(D\il\,rj, N ayg , p s o) and obtaining P(N avg ) from a fit to 
the data, to compare the data to the models and calculate confidence regions. 



4. Results: Application to the DLA Sample 

We include in our sample all DLA systems whose redshifts and HI column densities (or, in a 
few cases, equivalent widths) have appeared in the literature (Wolfe et al. 1986; Lanzetta 1991; 
Lanzetta 1995; Wolfe et al. 1995; Storrie-Lombardi, McMahon, & Irwin 1996), for a total of 73 
systems. In the cases where no HI column density has been confirmed, we have calculated it from 
the reported equivalent width using equation (3) in Wolfe et al. (1986). This is the same sample 
used by Storrie-Lombardi, McMahon, & Irwin 1996, with the addition of data from Wolfe et al. 
(1995). In our analysis, we have concentrated on high redshifts (z > 2), and hence have used 
subsets of this sample. 



4.1. Calculating the Present-Day Stellar Density p S Q 

Since our final results depend sensitively on the value and degree of uncertainty in the local 
stellar density p s o, it is important that we obtain the most accurate and precise possible estimate 
of this quantity from the literature. Previous studies (Wolfe et al. 1995; Lanzetta et al. 1995; 
Storrie-Lombardi, McMahon, & Irwin 1996) have employed the value p s o/p c = 2.7 x 10 _3±0 ' 18 /i _1 
(Gnedin & Ostriker 1992), where p c = 3Hq/8ttG is the present-day critical density, and the Hubble 
constant is Hq = WOh km s -1 Mpc -1 . We make use of recent observations to refine this estimate. 
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We compute p S Q by multiplying the local luminosity density of galaxies by their mean 
stellar mass-to-light ratio. Since the mass-to-light ratio is in general correlated with the galaxy 
luminosity, we have 

POO 

p s0 = / 4>(L)T(L)LdL, (20) 
Jo 

where <j){L) is the galaxy luminosity function (LF) and T(L) is the mass-to-light ratio in solar 
units. We have used the luminosity function for the NS112 sample of the Las Campanas redshift 
survey (Lin et al. 1996), which represents the most precise determination of the local LF to date. 
We include all galaxy types, since recent identifications of DLA galaxies (Le Brun et al. 1996) have 
indicated that DLA systems may be associated with a wide range of morphological types. Lin et 
al. (1996) obtain a best-fit Schechter function with the parameters M+ = —20.29 ± 0.02 + 51og/i, 
a = 0.70 ± 0.03, and 0* = (0.019 ± 0.001)/i 3 Mpc~ 3 , where the photometry was done in a band 
very similar to the Cousins R c band. Lin (1997, private communication) points out that these 
Schechter parameters also provide a good fit for the Gunn r-band luminosity function after a 
correction is made from isophotal to total galaxy magnitudes. Using a solar absolute magnitude 
of r = 4.83 (Broeils, 1997, private communication), this yields an r-band luminosity density 
of j r = (1.9 ± 0.1) x 10 8 /i Lq Mpc -3 . For the mass-to-light ratio T(L), we use the relation 
measured by Broeils h Courteau (1996) in the Gunn r-band for the disks of Sbc-type galaxies: 
T(L) = [5.8(L/10 10 L©)° 24 ± l]/t, where the uncertainty is the la deviation and the distribution 
of observed values is approximately Gaussian. Thus, the uncertainty in p s o is dominated by the 
Gaussian uncertainty in T(L). We include all galaxy types in our calculation of p s o, with the 
assumptions that spiral galaxies are well described by maximal-disk models, and that spiral and 
elliptical galaxies have the same stellar mass-to-light ratio. We take the mass-to-light ratio for 
Sbc galaxies as typical for all galaxy types of the same luminosity. Our approach overestimates 
the stellar mass-to-light ratio somewhat if, as some studies suggest (Rix et al. 1997), the dark 
matter accounts for a significant fraction of the total mass in the inner regions of galaxies. Using 
equation (p0|), we obtain 

= (4.0 ±1.0) x Hrt- ' 48 . (21) 

Pc 

Based on the uncertainty in T(L), we assume that the values of p s o are normally distributed with 
a variance a = 1.0 X 10~ 3 /i~ ' 48 . 



4.2. Effects of Dust 

The observed column density distribution of DLA systems, and hence the inferred value of 
Phi, is affected by the presence of dust in the DLA galaxies (Fall & Pei 1993). The dust obscures 
background QSOs, and causes incompleteness in the DLA sample. The presence of dust leads 
to an underestimate of phi from the data; accounting for this fact will lead to better agreement 
between data and predictions for high-A cosmologies. Fall & Pei (1993) showed that the true 
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column density distribution is given by 

F trne (N, z) = F ohs (N, z) exp[/3r(iV, z)), (22) 

where (3 is the power-law slope at the bright end of the QSO luminosity function [<j>(L) oc L~ 
and t is the extinction optical depth, given by 

T < w -*)-*w(iS«SFi)f(iTi)- < 23 > 

Here, k(z) = Pd/Pg is the dimensionless dust-to-gas ratio, £ is the extinction curve, and A e 
is the effective wavelength of the band of the QSO survey. We have used equations ( ]22] ) 
and (23), together with an assumption that the observed distribution F Q ^ s (N,z) may be fit by 



a gamma distribution (|T^), and that k(z) is proportional to the metallicity, to estimate the 
correction to the expected value of pm(z) due to dust obscuration. Following Pei & Fall (1995), 
we assume that k(0) = 0.8, (3 = 2, and £(A) = A#/A for QSO surveys in the P-band. We 
assume that the metallicity at redshifts z = 3 ± 1 is approximately one-tenth of the solar value, 
Z(z = 3 ± 1) = O.IZq. We find that the effect of dust is to reduce the inferred value of phi by a 
factor of ~ 1.5. Our results in the next section will include this correction. 



4.3. Results of Bayesian Analysis 

We have applied equation (|l9|), corrected for dust extinction, to the DLA sample for two 
redshift ranges, 2 < z < 4 and 2.5 < z < 3.5, allowing < Q\ < 1, — 1 < 77 < 1.5. To compute the 
prior distribution P(A r avg ), we first fit a gamma-distribution ( |i~2|) to the data, and use Bayesian 
methods similar to those described in §3.2 to obtain a probability distribution P(7, N+). We fix 
by requiring that the integral of F \^ s (N,z) over all column densities be equal to the total number 
m bs of DLAs observed in the narrow redshift interval under consideration. We then obtain a 
cumulative probability distribution for N avg by integrating numerically the probability P(7, N+) 
inside contours of constant A^avg in the (7, N+) plane, given by equation (|T4]). We differentiate this 
cumulative distribution numerically to obtain the differential distribution P(N avg ). Finally, we use 
equations (|i~3|) and (|i~6|) to obtain the likelihood function P(D\U\, 77, N avg , p s o). Substituting these 
results into equation (|ig|) gives the differential probability distribution P(p,^,rj\D). Confidence 
regions are obtained by integrating this distribution inside contours of constant P(£l\,ri\D). Our 
results do not depend strongly on the value of the Hubble constant; we assume Hq = 70 km s _1 
Mpc -1 . 

In figure la, we show 68%, 95%, and 99% confidence regions in the (Oa, rf) plane, for the 
redshift range 2 < z < 4; figure lb shows the same for the redshift range 2.5 < z < 3.5. In both 
cases, we assume that 25% of the present-day mass density in stars had been assembled by z = 2 
(/ = 0.25), consistent with the star formation rate of Madau (1996). The effect of the sample 
size is evident: the 99% confidence region is 20% smaller for the larger sample (45 objects with 
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2 < z < 4) than for the smaller sample (17 objects with 2.5 < z < 3.5). Note that, for the larger 
sample, the data are inconsistent with a net expulsion of more than half of the gas from galaxies 
(77 > 1) at the 99% confidence level, and also rule out 77 > 0.5 at 95% confidence and i] > 0.1 at 
68%, in all flat cosmologies. One may compare this result with the observational finding that 
the intracluster medium (ICM) in rich clusters of galaxies contains an amount of iron which is 
approximately equal to the amount found in the stellar component of galaxies in these clusters 
(Renzini et al. 1993; Loewenstein Sz Mushotzky 1996). The natural explanation is that the iron in 
the ICM was produced in supernova explosions inside of galaxies, and was subsequently expelled 
into the ICM by supernova-driven winds. If galaxies were fully assembled by a redshift z ~ 4, 
then they should have had twice as much baryonic material at such redshifts than is observed 
locally. Since our analysis excludes the possibility that there was more than 1.5 (1.75) times the 
present-day baryonic density at 95% confidence for the larger (smaller) samples, it suggests that 
(1) infall of material into existing galaxies or formation of new galaxies has taken place since 
z = 2; or (2) more than 25% by mass of the present-day stellar population formed by z = 2. These 
possibilities will be discussed in more depth in §5. Note also that for high-f^A cosmologies, outflow 
models are strongly ruled out, so it would be more difficult to reconcile the DLA data with the 
intracluster iron observations in an OA-dominated universe. 

If we wish to specialize to a particular cosmological model, we should consider the one- 
dimensional probability distribution P(rj\D, since the sizes of the confidence intervals decrease 
when the number of free parameters is reduced. Figure 2a shows the probability distribution for 77, 
given = (the case corresponding to the most conservative constraints on rf). We consider the 
sample with 2 < z < 4. The solid curves correspond to different assumptions about the amount 
of star formation that took place at z > 3 ± 1: / = 0, 0.25, and 0.5, from left to right. Upper 
limits on 77 derived from these curves are summarized in Table 1. As / increases, the gas density 
observed in DLAs is supplemented by more and more stars, so the data become more consistent 
with the closed-box model. As can be seen from Figure 2a, the data are consistent with modest 
to significant amounts of infall or formation of new galaxies subsequent to z ~ 3. Significant 
expulsion is less likely; for / = 25% the data are inconsistent with 77 > 0.42 at 95% confidence. If 
one allows for half of the present-day stellar population to form before z = 3, then one achieves 
marginal consistency with 77 = 0.92 at 95% confidence. However, in this case the metallicity of 
DLA systems will significantly exceed its observed value of O.1Z at z ^ 2 (Lu et al. 1996). Figure 
2b shows similar results for 17a = 0.7; here, no significant amount of expulsion is viable. Hence, 
it is unlikely that all of the baryonic material seen today in galaxies and the ICM of rich clusters 
was present in DLA galaxies at redshifts z > 2. The dotted curves in Figure 2 are the results 
obtained by considering present-day galaxies + ICM as a whole, with equal mass densities in stars 
and the ICM. Hence, we attempt to account for an amount "Pb,gai(0)" = 2p s o- We assume that 
25% of the present-day mass in stars + enriched ICM (that is, 0.5p s o) was present in the form of 
stars by z = 2. Clearly, only about half of this amount was present in DLA galaxies at z > 2 (see 
Fig. 2); the rest of it must have assembled into galaxies (and some of it subsequently expelled) 
after z = 2. This possibility will be discussed further in §5. 
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The constraints obtained with our method will improve as the catalog of DLA systems grows. 
The Sloan Digital Sky Survey, which is getting underway in 1997 (Gunn & Knapp 1993; see 
also http: / /www. astro. princeton.edu /BBOOK ) , will catalog ~ 10 5 quasars, at least an order of 
magnitude more than the number discovered to date (Loveday 1996). Spectroscopic follow-ups 
on this sample could increase the DLA sample size by 1-2 orders of magnitude. In Figure 3, 
we predict the effect on our results of a more modest increase in the DLA sample size. Figure 
3a indicates how the probability distribution P{rj) changes when the sample size m b s with 
2.5 < z < 3.5 is increased by a factor of 2 or 5. For illustrative purposes, we choose J^a = and 
/ = 0.25. We take into account the observational uncertainty a in the present-day stellar density 
/9 s o, and assume that the additional DLAs have the same column density distribution as the 
existing sample. In this case it is a simple analytical matter to determine P (ry) for a hypothetically 
larger data set. The trend is obvious: as more data are acquired, our measurement of ry becomes 
more precise, and -P(ry) becomes more sharply peaked. Given our assumptions, the mean value 
of ry will remain constant, but the confidence intervals will shrink. For the case shown in Figure 
3a, the 95% upper bound on ry is 0.32, 0.10, and -0.08, respectively, for m b s = 17, 34, and 85. In 
reality, the mean value of ry will shift around as the column density distribution becomes better 
known [i.e. -/V av g in equation (JlJ) will not remain constant as observations improve], but the width 
of P{rj) will still behave in the same way. The precision with which we can measure ry is limited 
by the 25% uncertainty in p s o- Figure 3b demonstrates what would happen if p s o were known 
exactly. In this case, the statistics are purely Poisson and the width (and hence the height) of our 
normalized distribution scales as (m b s + l) 1 / 2 . 



5. Summary and Conclusions 

We have used the catalog of DLA systems to place constraints on the amount of evolution in 
the baryonic content of galaxies and the value of the cosmological constant. We compared the gas 
density p g at redshift z = 3 ± 1 to the present-day stellar mass density p s o in galaxies for a range 
of flat cosmologies with Oa + Q m = 1. Our underlying assumption is that cold gas is required 
for star formation. We make use of the facts that DLA systems dominate the HI content of the 
universe at redshifts z > 2 and the correction to their baryonic mass due to molecular gas is small, 
£ 20% (Ge k Bechtold 1997). 

We defined ry to be the net fraction of the baryonic content of local galaxies which was 
expelled since z ~ 3 ± 1, and used Bayesian inference to derive confidence regions in the (ry, Q,^) 
plane. In all cosmologies we find that ry < 0.4 with at least 95% confidence, as long as only < 25% 
of the current stellar population formed before z = 3. The most likely value of ry is negative (cf. 
Fig. 2), implying a net increase by several tens of percent in the baryonic mass of galaxies since 
z ~ 3. The inferred value of ry is more extreme for f^A-dominated cosmologies. On the other hand, 
recent observations of high metal abundances in the intracluster medium of rich clusters (Renzini 
et al. 1993; Loewenstein & Mushotzky 1996) require that metal-rich gas be expelled from galaxies 
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in an amount approximately equal to the current mass in stars. The possibility that a dominant 
fraction of the present-day stellar population may have already formed by z = 3 and resulted 
early on in this expulsion, is ruled out by the low metalicities (0.01-0.1 solar) observed in DLAs 
at z ^ 3 (Lu et al. 1996). Moreover, the intergalactic medium which later accretes onto clusters 
of galaxies has a metallicity as low as ~ 10 -2 solar at z £ 2 (Cowie et al. 1995; Tytler et al. 1995; 
Songaila & Cowie 1996; Cowie 1996), rather than the needed value of ~ 0.3 solar. Most of the 
required metal enrichment and star formation activity must therefore have occurred at z <J 2. 

The most likely explanation to the above discrepancy is that a significant amount of gas had 
been assembled and partly expelled from galaxies after z = 2. The increase in galactic mass could 
have been either in the form of accretion onto existing galaxies or through the formation of new 
galaxies, such as those responsible for the faint excess in deep galaxy counts (Lowenthal et al. 
1996, and references therein). The likely value of rj of minus several tenths (Fig. 2), implies that 
more than half the associated baryonic mass was processed through galaxies after z = 2. As an 
example, let us assume that r/ = —0.5 at z = 3 ± 1, and that 150% of the current baryonic mass of 
galaxies had assembled after z = 2. This implies that the total mass processed through galaxies 
is twice (150%+50%) their current mass, as required by the observation that clusters contain 
twice the iron locked up in stars (Renzini et al. 1993; Elbaz, Arnaud, & Vangioni-Flam 1995; 
Lowenstein & Mushotzky 1996). Half of the processed mass was converted into galactic stars and 
half expelled into the intergalactic medium. If the expelled gas is ~ 10% of the intergalactic gas it 
mixed with [assuming Ob ~ 5% and the value of p s o in Eq. (|2l|)1, then it could have yielded the 
~ 0.3 solar metallicity observed in clusters as long as its original metallicity was a few times solar. 
The phase of massive metal enrichment must have occurred at z ~ 0.5-2 since the iron abundance 
in clusters shows little evolution at z <J 0.5 (Mushotzky & Lowenstein 1997). This inference could 
be tested observationally through a dedicated search for enhanced star formation activity and 
supernova rate at z ~ 0.5-2. 

We have employed a Bayesian analysis which has the dual advantages of taking the various 
observational uncertainties properly into account, and making use of unbinned data. The 
constraints obtained with our method will improve as the size of the quasar sample increases (cf. 
Fig. 3). In particular, future spectroscopic observations of the ~ 10 5 quasars cataloged by the 
Sloan Digital Sky Survey ( http:/ /www. astro. princeton.edu/BBQOK|) , could increase the current 



sample size of DLAs by 1-2 orders of magnitude, and improve our limits on the amount of galactic 
evolution considerably. 
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Level 


/ = 0.00 


f = 0.25 


/ = 0.50 


/ = 0.25 


0.0 
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-0.30 
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-0.40 




95% 


-0.06 


+0.42 


+0.92 


-0.14 




99% 


+0.20 


+0.66 


+1.10 


+0.20 


0.7 


68% 


-0.60 


-0.32 


+0.06 


-0.54 




95% 


-0.36 


-0.06 


+0.30 


-0.40 




99% 


-0.10 


+0.18 


+0.54 


-0.20 



Table 1: Upper limits on rj for two cosmologies and different fractions / = p s (3)/p s (0) of stars 
formed at redshifts z = 3 + 1. The last column includes the mass density in the intracluster 
medium (ICM), assuming 25% of the total (stars+ICM) was in the form of stars at z = 3 ± 1. See 
Figure 2 for the differential probability distributions P(rj) corresponding to columns 3-6. 
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Fig. 1. — Confidence regions in the (Qa,v) plane, computed with dust obscuration. Results are 
shown for two redshift intervals over which the HI density is calculated: (a) 2 < z < 4; (b) 
2.5 < z < 3.5. 
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Fig. 2.— Probability distributions P(rj), for (a) ft A =0; (b) tt A = 0.7. We use the 2 < z < 4 
subsample. 5Wid curves: results for three different amounts of star formation before z = 3 ± 1, 
namely 0%, 25%, and 50% of the present-day stellar density. Dotted curves: including the mass 
density in the intracluster medium (ICM), assuming 25% of the total (stars+ICM) was in the form 
of stars at z = 3 ± 1. 
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Fig. 3. — Effect of increasing the DLA sample size m b s , (a) including the 25% observational 
uncertainty in the present-day stellar density p s o; (b) assuming we know p s o exactly. We use the 
2.5 < z < 3.5 subsample. 



